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INFORMATION MANAGEMENT SYSTEM 



FIELD OF THE INVENTION 

The present invention relates to apparatus and methods for computerized 
information management 



BACKGROUND OF THE INVENTION 

Conventional, systems for computerized information management are 
described at tiie following Internet websites: 
www.clickmarks.com 
www.verity.com 
www.octopus.com 
www.snippets.com. 

The disclosures of all publications mentioned in the specification 
and of die publications cited therein are hereby incorporated by reference. 
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SUMMARY OF THE INVENTION 

The present invention seeks to provide improved systems and methods 
for information management usefiil for managing multiple dynamic electronic 

information sources. 

The system of the present invention preferably includes a 
complete information management system operative to allow users to organize, 
store, access, search, annotate, share, distribute, monitor and analyze multiple 
dynamic electronic mformation sources. Typically, the system mcludes 
multiple synergistic components that can be used individually or in conjunction 
with one another to achieve synergism of tiie components. 

There is tiius provided, in accordance with a preferred 
embodiment of the present invention, an information management system including a 
plurality of information sources, and an information source previewer operative to 
provide a preview of the information sources including a less than complete 
view of at least some of the information sources. 

Also provided, in accordance with another preferred embodiment of 
the present invention, is an infonnation management system mcluding at least one 
representations of information sources, a graphical user interface integrated vwth at 
least one of the representations of the mformation sources, and an archiving 
system operative to allow users to time-stamp and archive at least one 
representations of information sources. 

Further in accordance with a preferred embodiment of tibie present 
invention, flie archiving system is operative to allow remote archiving. 

Still furfhCT in accordance with a preferred embodiment of the present 
invention, the archiving system includes an annotator. 

Additionally in accordance val3x a preferred embodiment of the 
present invention, the graphical user interfece allows a user to specify vAich of a 
plurality of other users can access the content and how long content is to be 
stored. 

Also provided, m accordance with another preferred embodiment of 
the present invention, is an information management system including an archiving 

2 



wo 01/69448 



PCT/USOl/07567 



system operative to allow users to time-stamp and archive content, and a 
scheduling system allowing the archiving system to operate automatically in 
accordance with a predetermmed schedule. 

Further in accordance with a preferred embodiment of the present 
invention, the schedulmg system operates the archiving system in accordance with 
at least one triggering rule. 

Further in accordance witti a preferred embodiment of the present 
invention, the scheduling system is operative to perform a watch function in 
which predefined content is watched for. 

Also provided, in accordance with yet another preferred embodiment 
of the present invention, is an information management system including a content 
searcher, a search-defining GUI allowing a user to define a search, and a 
watch-defining GUI allowing a ws&c to define a watch at least by automatically 
converting a previously defined search into awatcL 

Additionally provided, in accordance with anoflia: preferred 
embodiment of the present invention, is an information management system including 
a content se£m:her and a search-defining GUI allowing a user to define at least 
freshness of search. 

Further provided, in accordance with another preferred embodiment 
of the present invention, is an information management system including a content 
searcher and a search-defining GUI allowing a user to define at least depth of 
search. 

Also provided, in accordance with anpth^ preferred embodimmt of 
the present invention, is an information management system including a content 
searcher and a search-defining GUI allowing a user to define at least duration of 
search. 

Further proidded, in accordance with still another preferred 
embodiment of the present invention, is an informatibn manag^ent syistem including 
an information source manager including a set of user-defined information sources, 
a content search^, and a search-defining GIH allowing a user to define a 
subset oftfaeus^-defined information sources to be searched. 

Additionally provided, in accordance with another piefisrred 
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embodiment of the present invetition, is an information management system including 
a server storing user-defined folders, and a client via which a user can view at least 
some of the user-defined folders. 

Also provided, in accordance with another preferred embodiment of 
the present invention, is an information management system including at least one 
representations of information sources including graphic representation of 
check-update status, and a check-update status inaintaiaer operative to monitor the 
check-update status of each information source and to maintain the graphic 
representation of the check-update status accordingly. 

Further provided, in accordance with still another preferred 
embodiment of the present invention, is an information management system including 
a search results GUI including a plurality of separate result windows for separate 
search results. 

Also provided, in accordance with still another preferred 
embodiment of the present invention, is an information management system including 
a docimient portion identification GUI operative to allow a user to graphically 
identify a portion of a document using a targeted set of questions, and a docimient 
portion processing unit operative to perform at least one process on a document 
portion defined by a user via the document portion identification GUI. 

Further in accordance with a preferred embodiment of the present 
invention, the system is operative to perform a search ov^ a specific part of an 
information source. 

Also provided^ in accordance witii another pref^ed embodimrat of 
the present invention^ is a information management system, including a plurality of 
ii^ormation management tools, an information source, and a GUI (graphic user 
interfiace) integrating the plurality of information management tools around the 
information source using a graphical representation. 

Furtii^ in accordance with a preferred embodiment of the present 
invention, at least one of tiie information sources is selectably accessed via a locally 
stored copy thereof ratiier than directiy. 

Still fijrdier m accordance with a preferred embodiment of the present 
invention, the scheduling system perfonns the watch function over a user-defined set 
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of information sources and over a user-defined time period. 

Further in accordance with a preferred embodiment of the present 
invention, the scheduling system includes a notifier operative to notify a user of "hits", 
the notifier employing any of a plurality of user-selectable notification modes. 

Also provided, in accordance with still another preferred 
embodiment of the present invention, is an information management system mclucting a 
watch unit operative to watch for a defined unit of information in a flow of 
information, and an EL A unit 

Further in accordance with a preferred embodiment of tiie present 
invention, the system is operative to perform an ongomg search over a specific part of 
an information source. 

Also provided, in accordance with a preferred embodiment of the 
present invention, is an mformation management system including an update checkmg 
unit,and anELAunit 

Further in accorcbnce with a preferred embodiment of the present 
invention, the system is operative to perform an ongoing update-check over a 
specific part of an information source. 

Still fiirther in accordance with a preferred embodim^t of the present 
invention, the document portion processing vmii is programmable to perform 
customized fimctions, thereby to allow a user to perform customized processes on 
specific document portions. 

Additionally in accordance with a preferred embodiment of the 
present invention, the client displays multiple sources simidtaneously. 

Further in accordance with a preferred embodiment of the present 
invention, the client operates within a standard web browser without downloading and 
installing specialized software. 

Still furtho: in accordance with a preferred embodiment of the present 
invention, the seardi results GUI diq)lays a fist of results and, simultaneously, tiie 
results th^nselves in separate windows. 

Also provided, in accordance with a preferred embodiment of tiie 
present invention, is an information man^^ent system including a functional imit 
operative to p^orm a plurality of selectable fimctions on information, and an 
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automatic information retriever operative to automatically retrieve information from a 
plurality of information soiirces. 

Further in accordance with a preferred embodiment of the presrat 
invention, the automatic information retriever is selectably operative to automatically 
retriever information on a condition-triggered basis. 

Still further in accordance witii a preferred embodiment of the present 
invention, the system also includes an ELAunit 

Further m accordance vn&i a preferred embodiment of the present 
invention, multiple user-selectable notification methods are employed to bring system 
work products to a user's attention. 

Still further in accordance with a preferred ^bodiment of the 
present invention, the system also includes an inter&ce allowing mobile access to and 
control of the system. 

Also provided, in accordance with a preferred embodiment of the 
present invention, is an information managem^t system including an information 
source processor operative for performing user-selectable information management 
processes on any user-selectable information source from among a plurality of 
information sources, and an ELA interfece constructed and operative to allow a user to 
identify specific elements of documents as information sources. 

Further in accordance with a preferred embodiment of the present 
invention, the specific elements which a user is allowed to identify include at least one . 
of the following group: image, phrase, table, sub-table, line, , cq)tion, cell, row, 
column, item, list, paragraph, firame. 

Additionally in accordance with a preferred embodiment of the present 
invention, the ELA interface is operative to group several elements in a document 

Still fiirther in accordance witii a prefored embodiment of the present 
invention, the ELA interface is operative to contiguously groiq) several elements in a^ 
document 

Additionally in accordance with a preferred embodiment of the present 
invention, the ELA interfiice is operative to non-contiguously group several elements in 
a document 

Further in accordance with a preferred mbodiment of the present 
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invention, a group of at least one elements may be identified by means of a 
combination of at least one internal properties. 

Still further in accordance with a preferred embodiment of the present 
invention, a group of at least one elements may be identified by means of their 
relationships to other elements having a specified combination of at least one internal 
properties. 

Further in accordance with a preferred embodiment of the presOTt 
invention, the internal properties include at least one of the following group: contains 
a specified text, possesses at least one descriptive formatting property, contains 
specified markiq>T-tag information. 

Still fiirther in accordance mth a preferred embodiment of the 
present invention, the at least one descriptive formatting property includes at least one 
of the following group of property types: a color property, a size property, and a style 
property. 

Furth^ in accordance with a preferred embodiment of the present 
invention, flie relationships include at least one of the following type of rclation^ps: 
after, before, between, contained in, location in group, bigg^, biggest in group, first, 
smallest, largest 

Also provided in accordance wifli a prefi^red raibodiment of the 
present invention are methods for implementing and employing the systems shown 
and described herein. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The present inventioii will be understood and appreciated from tiie 
following detailed description, taken in conjunction with the drawings in which: 

Fig. 1 is a sunplified pictorial iUustratiGn of a screen display of an 
"add folder" interface constructed and operative in accordance with a preferred 
embodiment of the present invention which is useful in implementing a Folder View 
functionality provided in accordance with a preferred embodiment of the present 
invention; 

Fig. 2 is a simplified pictorial illustration of a screen display of a 
"folder view" interface constructed and operative in accordance with a preferred 
embodiment of the present invention; 

Fig, 3 is a simplified pictorial illustration of a screen display of an 
"add source" interface constructed and operative m accordance with a preferred 
embocUment of the present invention; 

Fig. 4 is a detailed illustration of an individual one of the Topic 
Windows (such as Window 230) illustrated in the screen disi)lay ofFig.2,in a first, 
Web, mode useful in implementing a Topic Window fimctionality provided in 
accordance with a preferred embodiment offhe present invention; 

Fig. 5 is a detailed illustration of an individual one of the Topic 
Wmdows (such a$ Window 230) illustrated in the screen display of Fig. 2, in a 
second, Not^, mode useful in implementing the Topic "^mdow functionality, 
^accessed by clicking the Notes button 430 m Fig, 4; 

Fig. 6 is a simplified pictorial illustration of a screen display of a 
"SHOW NOTE" interface constructed and operative in accordance with a preferred 
. embodiment of tiie present invention, ttiat appears when clickmg on an individual 
note listiing 520 in mode 2 (notes) of a topic wndow, such as that shown in Fig. 5; 

Fig. 7 is a detailed illustration of an individual one of the Topic 
Windows (such as Window 230) illustrated in tiie screen display of Fig.2,in a third. 
Watch, mode useful m unplementing tiie Topic Wmdow fimctionality, accessed by 
clicking the Watch button 440 in Fig. 4; 

Fig. 8 is a detailed illustration of an individual one of tiie Topic 
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Windows (such as Window 230) illustrated in the screen display of Fig. 2, m a 
fourth, Archive, mode useful in unplementing the Topic Window fimctionality, 
accessed by clicking the Archive button 450 in Fig. 4; 

Fig, 9 is a simplified pictorial illustration of a screen display of an 
"search" interface, accessed through the menus 210 at the top of the screen display in 
Fig. 2, constructed andoperatiye in accordance with a preferred embodiment of 
the present invention; 

Fig. 10 is a simplified pictorial illustration of a screen display of 
an "search results" interface, accessed by cntwing information in the search interfece 
and selecting the "Search" button in Fig. 9, constructed and operative in accordance 
with a preferred embedment of the present invention; 

Fig- 11 is a simplified pictorial illustration of a screen dis^jlay of a 
"watch" interfiw^, accessed through the menus 210 at tiie top of the screen display in 
Fig, 2, constructed and operative in accordance with a preferred embodimoat of 
the present invention; 

Fig. 12 is a simplified pictorial illustration of a screen display of a 
"Add Note" interfece, accessed through the menus 210 at the top of the screen display 
in Fig. 2, constructed and operative in accordance with a preferred 
embodiment of the present invention; 

Fig. 13 is a simplified pictorial illustration of a screen display of an 
"archive" interface, accessed through the menus 210 at the top of flie screen display 
in Fig, 2, constructed and op^atiye in accordiance with a preferred 
embodiment of the present invention; 

Fig. 14 is a simplified pictorial illustration of a screen display of a 
"scheduled archive" interface, accessed through the menus 210 at the top of the screen 
display in Fig. 2, cpnstructed and operative in accordance with a preferred 
emboditnmt of the present invention; 

Fig. 15 is a simplified pictorial illustration of a screen display of an 
"import folder" interfece, accessed by pressing the "imyport" button m the screen 
cUsplayof Fig. 1, constructed and operative in accordance with a preferred 
embodiment of the present invention; 

Fig. 16 is a simplified pictorial illustration of a screen display of an 

9 



wo 01/69448 



PCT/USOl/07567 



"search for folder to import" interface, accessed by pressing the "search" button in the 
screen display of Fig. 1, constructed and operative in accordance with a 
preferred embodiment of the present inventior^ 

Fig. 17 is a simplified pictorial illustration of a screen display of an 
"hnport information source" inter&ce, accessed by pressing the "unport" button in the 
screen display of Fig. 3, constructed and operative in accordance with a 
preferred embodiment of flie present invention; 

Fig. 18 is a simplified pictorial illustration of a screen display of an 
"search for information soiurce to import" interface, accessed by pressing the "search" 
button in the screen display of Fig. 3, constructed and operative in accordance with 
a preferred embodiment of the present invention; 

Fig. 19 is a simplified pictorial illustration of a screen display of a 
typical web page that contains multiple elements, and that serves as an example of 
identifying elements within information sources by the use of element level access 
(ELA), in accordance with a preferred embodiment of the present invention; 

Fig. 20 is a simplified flowchart of a preferred method for 
implementing the ELA interface, in which arrows indicate a typical order of 
operations, accessed througji the menus 210 at the top of the screen display in Fig. 2, 
constructed and operative in accordance with a preferred embodiment of the 
present invention; 

Fig. 21 is a simplified fimctional block diagram of a client-server 
implementation of an information management system constructed and operative in 
accordance with a preferred embodiment ofthe present invention; 

Fig. 22 is a simplified fimctional block diagram of a preferred 
implementation of the server 21 10 of Fig. 21; 

Fig. 23 is a simplified fimctional block diagram of a preferred 
implementation for the portfolio service block 2220 of Fig. 22; 

Fig. 24 is a simplified flow chart of a preferred method for 
implementing tiie Content S^ce block 2210 of Fig. 22, in vMch arrows 
indicate a typical Older of op^tions; 

Fig, 25 is a simplified data flow diagram showing preferred data 
flow to the content service block 2210 of Fig. 22; 

10 



wo 01/69448 



PCT/USOl/07567 



Fig. 26 is a simplified control flow diagram showing preferred control 
flow to the content service block 2210 of Fig. 22; 

Fig. 27 is a simplified flow chart diagram showing preferred order 
of operations of the Contait Idmtifier block 2570 of Fig. 25; and 

Fig. 28 is a simplified flow chart diagram showing the preferred 
order of operations of the Picture Renderer block 2560 of Fig, 25. 
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DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

Reference is now made to Fig. 21 which is a simplified 
functional block diagram of an information managanent system constructed and 
operative ia accordance with a preferred embodiment of the present invention. 
Typically, information is stored by the system using a hierarchical structure. Portfolios 
(one per user) contain folders (zero or more per portfolio), which in tum contain 
information sovirces (zero or more per folder). Information sources are displayed by 
means of topic windows, which appear in the Folder View display described 
below. 

The system of the present invention preferably processes and/or displays 
infomiation in information units termed portfolios, folders and information sources. 
Each of these terms is now described in detail. 

Portfolios 

Each user of the system is assigned a portfolio. A portfolio stores 
the information of a particular user. Using a gr^hical user inteifece (such as 
Fig. 1), the user may add or remove folders fix)m the user's portfolio. Fig. 2 shows 
a portfolio containing four folders, as viewed in flie Folder View (escplained below). 

" Folders 

Folders contain gro\q>s of related information sources^ each 
represented by a topic window. Folders may contam other folders and/or information 
sources. Using a graphical user interface, (such as Fig. 3) the user may specify one 
or more mformation sources or folders to add or remove firom a folder. Each 
information source in a folder is represented by a topic window, defined below. 

Information Sources 
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An infonnatioii source may comprise any electronically stored 
information that is accessible by the system. Examples of information sources 
include, but are not limited to: Web documents located on the Internet or a 
local Intranet, files i5)loaded to the system or available to the system via a 
network file system, archives, notes stored in the system, email folders, schedule 
managers, any information stream or data feed coming &om a local or remote source* 

Using a graphical user inter&ce the system typically allows the user 
to specify an entire document as an information source, or alternatively the user 
may identify a specific portion or portions of a document as an information source. 
The process of identifying specific elements of documents as information sources 
is known as Element Level Access (ELA) and is described below in the section 
"Element Level Access". 

Using the system of the present invention, the user may access 
information sources v^ose access is controlled by security measures. For 
example, the system of the present invention may be constructed and operative to 
access information sources that require a usemame smd password. Using a 
graphical user inter&ce, the user may enter the appropriate security information 
(e.g. usemame and password) into the system. The system typically stores the 
security information and is tiien able to use it to automatically access the secure 
information source. 

The system of tiie present invention fypically provides one, some or all 
of the following functionalities: 

Folder View — e.g. as in Fig. 2, 

Topic Window (Modes 1, 2, 3, 4) - e.g. as in Figs. 4, 5, 7 and 8, 
Information Source Preview, Monitoring — e,g. as in Fig. 4, 

Search— e.g. as in Figs. 9 and 10, 
Watch - e,g. as in Fig. I I, 

Notification, Annotation (Notes and Files)— e.g. as in Figs. 6 and 12,. 

Stor^e: Arcfai>dng — e.g. as in Figs. 13 and 14, 

Collaboration (Groins and Sharing), Mobility /Access to system (GUI, 
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text, WAP mterfaces) - e.g. as in Fig. 23, 

Element Level Access - e.g* as in Figs. 19 and 20, and 
Functions and Analysis. 

Each of the above fonctionalities is now described in detail with 
reference to the figures designated above. 

Folder View 

Infonnation available through the system typically maybe viewed in a 
number of ways. When accessing the system v/ith a standard web browser, 
information may be displayed using the Folder View (Fig. 2). In the folder view, the 
contents of a specific folder in a xiser's portfolio are displayed. Each of the 
information sources within the specific folder is displayed in a topic window 
230, 231, 232, 233, 234, 235 described in fiirther detan below. The topic 
wmdows are ^ically displayed in a grid inside the main window of the web 
browser being used. 

Topic windows are by defeult displayed in mode 1, in &e 
illustrated embodiment, resulting in a user display winch as indicated by reference 
niraierals 230 - 235 of Fig. 2, comprises a grid ofi^ature gr^hical renditions 
of the information sources in the folder. For ^cample, if the infonnation 
sources arc HTML documents such as fliose found on the Worid Wide Web, they are 
preferably rehdercd by the system into a miniature version of what a user would 
usually see in a standard web browser. This results in the equivalrat of having 
many small web biov/sers tiled across the screen, each showng an individual 
information source. This view provides the user wifli a way to view graphically 
multiple information sources simultaneously. Using a graphical user interface, the 
user may specify the arrangement of the topic windows within the folder vieWj 
including but not limited to: tiie size of each of tiie topic windows m the grid, and 
the number of rows or columns tiiat are displayed. For example, a set of six topic 
windows in a folder may be displayed 3 x 2 as in Fig. 1, or 2x3 or 6x1, etc. 

In tiie folder view, a list of folders m the user's portfolio may also 
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be displayed in the browser window. Using a suitable graphical user interfece, 
(such as the folder buttons 220, 221, 222, 223 of Fig. 2) the user may choose 
viiich folder's contents are to be displayed in the folder view. 

In the folder view, a user may access various other functionalities 
of the system through a graphical user inter&cesuch as aset of menus or buttons 
210 of Fig. 2) &at also appear in the browser window. 

Preferably, the screen display of Fig. 2 serves a main screen and the 
menu in Fig. 2 typically allows the user to select any of a plurality of menu options 
corresponding to various functionalities of the system, sudi as the following menu 
options: 

Adding functionalities: Add, add information source, add folder, add 
note, add watch, add archive, add scheduled archive, add analysis. 

Resetting functionalities: Reset (clears hordes that indicate information 
content changes. Reset folder. Reset portfolio. 

Other functionalities: Editing, Display preferences (including editing 
of rows and colximns e,g, 3 x 2 or 6 x 1 of folder view). Search, Do Search, Groups, 
Edit Groups, and Edit Sharing. 

Topic Windows: As shown in Figs. 4, 5, 7 and 8, each information 
source in the user's portfolio typically appears in a topic window. A schematic 
representation of a possible implementation of a topic window is shown in Fig, 4. 
Using a graphical user interface, the user may toggle a topic window between 
one of four modes, numbered i, 2, 3, arid 4. Buttons 420, 430, 440^ 450 for 
toggling betweenniodes are shown at the top of the topic window. Hie name of 
the information source is shovm at the bottom of the topic window 480. The 
information displayed in the central area of the topic window 410 depends on the 
mode that the topic window is irL Four modes of each topic window provided in 
accordance with a preferred embodiment of the preset invention are now 
described. 

Topic \Wndows - Mode 1 fWeb Mode^: As shown in Fig. 4, mode 1 is 
accessed by clicking on the "web" button 420 in a topic window (Fig: 4). In tins 
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mode, a miniaturized graphical representation of the information source is displayed 
m the central area 410 of the topic window. For example, if tiie information source 
is a web page, a miniaturized graphical rendition 470 of the web page is displayed in 
the central area 410 of the topic vwndow. Clicking on the central area 410 of a topic 
window in mode 1 causes the actual mformation source represented by the topic 
wmdo w to appear in the mam window of tiiie browser, replacing the folder view (Fig. 
2), This mechanism provides, the user with an intuitive graphically-based method 
of accessing various information sources with a single click of the mouse. 

As described m the ''Monitoring" section below, the system 
typically continually monitors information sources for changes. When an 
mformation source has changed since the most recent time it was accessed by a 
particular user, a graphical indication (for example, a colored border 460) 
appears around tiie picture in ttie topic window representing that information 
source in the portfolio of tiiat user. When the user clicks on tiie picture 470 to access 
the information source, the graphical indication 460 is removed. 

Typically, the colored border 460 is present whenever the information 
source has changed since the most recent time tiie usct has accessed tiie information 
source. 

Reference numeral 470 indicates a picture of the information source 
shown in tiie central area of the Topic Window. 

Topic Window - Mode 2 (Notes Mode): A s shown in Fig. 5, tiie Notes 
Mode (Mode 2) is accessed by clicking on flie "notes" button 430 in a topic window. 
In fliis mode, a list of notes assigned to tiie information source appears in the central 
area 510 oftiie topic window. Notes are annotations or files created by hscts and 
assigned to specific information sources, as described in tiie "Annotation" section 
below. Clicking on ttie row of words tiiat refer to a specific note in tiie central area of 
tiie topic window of Fig, 5 causes tiie contents of tiiat specific note to be displayed 
m a separate window (Fig. 6) on tiie user's screem. For example, if a user clicks on 
'•Note 3 Jim 5:45 PM Support" 520 m Fig. 5, a separate window will q)pear 
displaying tiie contents of the corresponding note. Using a graphical user interfece, 
users may delete notes from within mode 2 of a topic window. 

Topic Window - Mode 3: As shown in FiR. 7, tiie Watch Mode (Mode 
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3) is accessed by clicking on the "watch" button 440 in a topic window. Watches are 
ongoing searches that are created by users and assigned to specific information 
sources, as described in the "Watch" section below. In this mode, a list of watches 
cunently assigned to the infonnation source appears in the central area 710 of the 
topic window. An example of a list of 2 watches currently assigned to one 
infonnation source is illustrated in Fig. 7. Using a gr2q)hical user inter&ce, watches 
may also be deleted from within mode 3 of a topic window. 

Topic Window - Mode 4: As shown in Fig. 8, the Archive Mode 
(Mode 4) is accessed by clicking on the "archive" button 450 in a topic 
window. Archives are time-stamped and annotated versions of information 
sources are preferably stored by the system on behalf of viscxs and assigned to 
specific information sources, as described in the "archives" section below. In this 
mode, a list of archives assigned to the infonnation source appears in the central area 
810 of the topic window. Clicking on flie row of words (Such as "Archive 1 Jon 
4:55pm Earnings" 820) that refer to a specific arcMve in the central area of the topic 
window in mode 4 causes that specific archive to be displayed in the web browser 
window, replacing the folder view. Using a graphical user interface, archives may 
also be deleted from within mode 4 of a topic wndow. 

Information Source Preview 

An information sources preview is a larger view of the graphical 
rendition that appears in mode 1 of the topic window. A user? may use a graphical user 
interface to cause an information source preview to appeax inside the folder view 
(for example, by positioning the mouse pointer over the name of the infonnation 
source that appears at the bottom of the topic window). The iijfonnation source 
preview is large enough to allow the user to read or view some or all of the 
infonnation coiitained in the information source. 

Since this graplucal rendition is typically abeady pre-rendered on 
the system, the preview typically appears without having to wait for the user's 
machine access the infonnation source directly . In the case of remote irdformation 
somces such as documents on the World Wide Web, an information source 
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preview typically allows the iiser quicker access to the information than would 
otherwise be achievable by accessmg flie information source directly through 
the browser. Using agrapMcal user interface, the user may enable or disable the 
infomiation source preview functionality and modify its display properties (for 
example, its size). 

Monitoring 

The system preferably monitors information sources on an ongoing 
basis, and notifies users of any significant changes. On an ongoing basis, the 
system typically accesses all the information sources in all of the users' portfolios 
in order to check for modifications to the content of tiie information sources. The 
system typically detects changes by comparing the latest version of the information 
source content with the most recently stored version of the information source content 
The system preferably notifies the users vAxo have the information source in their 
portfolio of any significant changes. 

The system typically uses filters (described in the Content Identifier 
section below) to determine whether changes to the content are significant or 
insignificant Examples of filters include but are not limited to: filters that ignore 
changes relating to the time and date, advertisements, or counters that rqport the 
number of visitors to a web site. See the "Content Identifier" section below. The 
operation of the Content Identifier 2570 is described m fiirther detail in Fig. 27. 

One method of notifyingtheuser of a change is a colored border 460 
that s^pears around the graphical rendition of the information source in Mode 1 of 
the topic window. Another mettiod of notification is a graphical indicator that 
appears in the button representing the folder 220-223 tiiatcontams the information 
source that has changed. This latter methoii is usefiil in that it allows a user to be 
notified when an information source has changed soine^ere in the portfolio 
that is outside of the folder cuirentiy being viewed m the folda: view. 

Change notifications are typically maintained by the system Qn 
the Portfolio Database 2320, described below) on a per-user, per-mformation 
source ba^: The system preferably keq)s trackof when each user accesses each 
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specific information source. A change notification is displayed to a specific user 
only when a specific information source has changed more recently than that 
specific user has accessed that specific information source. 

The system typically allows the user to clear the change notifications 
on an entire folder or the entire portfolio. This is useful when the user has not accessed 
the folder or the portfolio for an extmded period of time during vdiidi many of the 
information sources have changed. The user may then wish to clear all the 
change notifications that have accumulated and only be notified ofdianges that 
occur fix>m that point in time onwards. 

Search 

As. shown in Figs. 9 and 10, the system .typically allows a user to 
identify specific information of interest through the use of the search functionality. 
Using a graphical user inter&ce such as that of Fig. 9, the user may specify 
multiple parameters vdien setting iQ) a search. 

The search Isacms define tiiie pattern of information to be searched for. 
This may include individual words, phrases, and Boolean eaqpressions (for example 
"(Earnings AND Sales) OR (Year End Report) AND NOT (Quarterly)"). 

The user may also specify the search domain. The seardi domain is the 
information source or set of information sources to be searched. The search 
domain may be selected, for example, &om any groiq) of information sources or 
folders within the user's portfolio. 

The user may also specify the search deptii, which controls how 
many levels the system fypically branches off of an information source included in 
the search domain to otl^er information sources that are not necessarily included in 
the search domain. For example, if a certain page on the World Wide Web is included 
in the search domain, a search depth of one typically dkecis the system to not only 
search the said page itself, but also to search other pages that flie page refers to 
through hyperlinks. A search depth of two typically directs the system to furth^ 
search all pages referred to by tiie pages refeired to by the said page, and so forfL 

A user may also specify tiie degree of search fieshness. Thesysteni 
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can typically reduce the time it takes to perform a search by searching through 
pre-cached, or locally stored, versions of the content instead of taking the time to 
access all the various information sources directly at the time of tiie search. This 
pre-cached mformation is typically stored by the system on a regular basis in the 
content database (descaibed below), in order to perform the update checking 
functionality. However, the stored versions of tiie content may not be completely up 
to date with the content in the live mformadon sources thwnselvw. Since 
information sources may constantly be changing, it may be desirable for users to 
ensure that the system is searching recent, up-to-<iate versions of the information 
source contents. By letting the user dictate whetiier stored or live versions of the 
content are to be used, the system typically allows a user direct control over the 
tradeoiBf between the freshness of content being searched, and the speed with which the 
search is being performed. 

The user may also specify the results format, including the level of detail 
in which the search results arc displayed. For example, the user typically may direct 
the system to display only the names of the information sources that contain results 
matching tiie search terms. (For example, when searching for information about 
"India" within a folder containing ten news web sites, only three may match: 
"CNN, MSNBC and ABCNEWS report matches to the search"). Alternatively, the 
user may direct the system to display actual selections from the matching content 
in addition to tiie name of the mformation sources that contauned the 
matching content (For example: "CNN: Mudslide in India, MSNBC: India reports 
economic foxwast ABCNEWS: Indiahas mudslide"). 

When the search is complete, the results are displayed in a separate 
window (the "results listing vwndow") (Fig. 10). that appears above flie main 
browser window. By eliciting on the individual result listings in the results 
listings window, the corresponding information sources are displayed in the main 
browser window. This allows &e user to view simultaneously tiie listing of results 
as well as tiie results tiiemselvcs. This functionality provides tiie user with an 
added level of convenience over the commonly implemented interface in 
which eitiier the results or the listings may be viewed, but not both at ttie same time. 

After a search is complete, tiie user is given the option of 
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automatically converting a search into a watch, described below. This saves the user 
the time of re-entering the information to set up a similar watch- 

Watch 

As shown in Fig. 11, a user may configure tibie system to perform a 
watch. A watch is an ongoing search for infonnation matching a specific pattern, 
perfonned oyer a specific period of time. When setting \xp a watch, users can 
speciUP/ all the same parameters as when setting up a seardi, as described above in the 
section "search". In addition, usmg a graidiical user inter&ce, the user can specify 
the duration of the watch, and the notification method (Fig. 11). The duration 
may be specified as any length of time, at the end of which the watch is completed 
and no more searching takes place. During the course of the watch, the content is 
checked at regular intervals, according the configuration of the system as 
described in the "Content Retriever" section below. The notification methodology 
may be selected from one of the notification methods available to the system, as 
described below in the "Notification" section. 

For example, a user may want to find out ^^ether or not a set of 
companies (whose web sites are contained in a folder called "Companies") are 
reporting their corporate earnings during flie course of a particular week. The user 
may set up a week-long watch for the words "Earnings" within the folder 
"Companies". As the week progresses, the system preferably continually checks 
the various information sources within this folder, and not^es the user using the 
desired notification method (for example, fax) if and when the word "Earnings" 
appears in any of the sources. 

Notification 

To allow users maximum access to the system fi'om wherever the 
user may be, any device with which the system can communicate preferably xnay be 
used for notifying the user. Examples include, but are not limited to, on-screen 
notification (such as a colored border or other graphical indication Avitilin, for 
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example, mode 3 of the topic window)^ notification through an e-mail message to an 
email address or addresses that are pre-specified by flie user, notification 
through an Instant Messaging protocol, notification through a commonly 
available paging device, notification using a messaging system (such as SMS) to a . 
mobile phone or mobile device, notification to a fax machine at a telephone 
number pre-specified by the user, notification to a printer pre-specified by the user. 

Using a gmphical user interface, tiie user may enter into the syst^ 
any information the system may use to communicate with the various devices on which 
the user wants to receive notifications. Examples include but are not limited to: 
Email addresses, telephone numbers, etc. 

Annotation: Notes and files 

As shown in Figs. 6 and 12, the system preferably allows users to 
annotate information sources in various ways. Notes (allow a user to assign a text 
message to an information source or group of information sources. Using a 
graphical user interface (Fig. 12), a user may specify a subject or tide for the note, 
indicate the status of the note (for example, "urgent" or "please reply"), compose 
the body of the note (typically a textual message) and indicate to which 
information source or sources in the user's portfolio the note should be assigned 

A user may also use the system to iq)load any type of file 
accessible from the us^s machine and assign it to an information source 
or groq) of information sources. Notes and files assigned to an information 
source are typically stored on the server 2110 of the system (seethe "Architecture" 
section below) and may be viewed through mode 2 of the topic window 
representing that information source. Using the collaboration and sharing 
capabilities of the system, described below in tiie "Collaboration" section, 
users may share notes ajid files with other us^ or groups of users. 

Storage 



As shown in Figs. 13 and 14, the system also typically provides 
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integrated storage ciapabilities for information sources. Using a graphical user 
interface (Fig. 3), a user may direct the system to archive a particular 
information source or set of information sources. A system typically creates an 
archive of an information source by locally storing in the content database a 
time-stamped copy of the current version ofthemformation source contents. A user 
may intf cate which specific information source to archive, the period of time for 
\vhich the archive should be kept on the system before being deleted, and a name to 
assign to a particular archive. Archives are stored in the content database (see tiie 
"Architecture" section below) and may be accessed tinough the Archive Mode (Mode 4) 
of the topic window representing the particular information source. Archives are 
useful for users who may, in the future, wish to access content which is no longer 
available on the information source which provided that content originally. 

The system may also be configured for scheduled archiving, in 
which a user indicates, using a gr^hical user interface, (Fig. 14) a specific pomt in 
time, or specific points in time, during \^diich an information source should be 
archived by the system. The user may also indicate an archiving fi:equency to 
direct the system to archive an information source or sources at regular intervals. 
A user may also specify a set of conditions (see the "Functions" section below) 
that, if matched, will trigger the archive to be created. With scheduled archiving, the 
user preferably does not have to be present at the time of archiving to direct the system 
to create the archive. 

Collaboration 

The system typically provides integrated collaboration 
capabilities. Usmg a graphical user inter&ce, users may create groups. Groups may 
include users and/or other groups. Groups may represent a set of users that may 
have certain interests in common. Groups are useful when combined witii the 
sharing functionalities of the system. 

The system typically provides integrated sharing functionalities. 
Using a graphical user interface, a user adding a resource (a resource is an 
information source or a folder containing information sources) to tiie system has the 
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ability to control which other users or groups have access to the resource, as well 
as what type of access each user or group has ("access level"). For example, a 
group of users may be configured to only be able to read a resource, but not 
change it Other examples of access levels include, but are not limited to: full 
permissions, add permissions, delete permissions, annotate permissions, read-only 

pemiissions, no permissions. 

Using a graphical user interface, a user wanting to access a shared 
resource may import the shared resource into the user's own portfolio OFigs. 15 and 
17). If the user is unsure of the name of the resource or of the name of the user that 
created the resource, the user may search for the resource to unport using a graphical 
user interface (Figs. 16 and 18). An imported resource is added to the user's portfolio 
and the user may interact with it in a way that is determined by the access level set 
for that user for that resource. 

Using the sharing functionality, groups of users can share 
resources. Some useful examples of sharing include, but are not limited to: Shared 
folders where one user assembles a set of relevant information sources and other 
users benefit firom the useful collection of information sources; Shared notes 
where users can conduct a discussion relatmg to a particular information source 
or set of information sources; shared notifications w*iere one user sets up a watch 
and other users benefit firom the notification resulting firom the watch. 

Mobility / Access to the system 

The system typically provides users access to the system firom 
anywhere on any device. The primary method of interacting with the system is 
typically the graphical user interface 2330 of Fig. 23, accessible through a standard 
Web Browser and described in the sections above. To access thesystem in this 
manner, the user typically employs a computer wifli commonly available standard 
web browser software installed and a connection to a networic tiirough vMch the 
serverof the system is accessible. There is no need for a user to download or install 
any additional software on the local machine, allowing the user a high degree of 
mobility relative to systems where specific software (other tiian a standard web 
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browser) needs to be installed on the local machine in order to access the 
functionalities of the system. 

The system may also be accessed through a text interface 2340. In this 
interfece, all the graphical user interface components of the system (such as those 
mentioned in die descriptions above) may be rq)laced by equivalait tsact-only 
interfaces. This interface is useful for users accessing the system over a low 
bandwidth connection that would otherwise involve slower interaction times 
(between the user and the system) if using the standard graphical user interfece. 
The slower interaction times would be due in large part to flietime it would take to 
download the grq)hical interfece components fix)mtties«ver to flie User's computer. 

The system also i^calfy has the capabilities to be accessed by 
mobile devices, examples ofwhich include, but are not limited to PDAs and mobile 
telephones. Special interface modules are designed in Ae system to handle the 
specific protocols of these devices. For exan^le, a WAP (Wireless 
Applications Protocol) interfece module typicaUy aUows access to the systein from 
any WAP-enabled de\dce 2350. 

Security measures arc ^ically provided for users accessing the 
system. Using a graphical user interfece, the system typically pnanpts the user for a 
user name and password before aUowing access to a particular portfolb. Using a 
gr^hical user interface, a yser may also change the password that controls acc6ss to 
said user's portfolio. Users may also access the system through secure 
communication protocols. Examples include but are not limited to ht^s. 

Elqnent Level Access: Interfece 

As shown in Figs. 19 and 20 and as described above, the user may use a 
graphical user interface to identify a specific element of a document accessible by 
the system for use as an information source in the user's portfolio. 

The user may identify specific eleihents within a document 
Examples of elements include, but are not limited to: table; cell; row; column; 
image; list item; list; line; paragM^ fiame; any region of text distinguishable fram 
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its surroundings by font size, style, color or other properties. The user may also 
select groups of two or more elements, whether or not they are contiguous in the 

document. 

Specific elements may be described^ in a number of ways, 
including, but not limited to: 

1. Contained or nearby text Examples include, but are not limited 
to: The cell that contains the text "Last Trade"; The row that appears after the 
words "Minutes remaining"; The table that appears before the words "Summary 
Statistics". 

2. Markup tags surrounding the element Examples include, but are 
nothmitedto: <font si2e24> ... </font>; <foo>...</foo> containing "bar". 

3^ By structure. Examples include, but are not limited to: The second 

column ofthe fourth table; an image of a certain size. 

4. Combinations of the above. Examples include, but are not limited 

to: the cell containing "Last trade" in Ae table containing "Stock 3". 

An example is shown in Fig. 19. Document A contains two tables B and 
C. Both tables contain stock quotes for the stocks RHAT and AKAM respectively, 
nie name ofthe stocks are located in the cells D and F respectively. TTie last trade 
valiKS are located in cells E and G respectively. 

la die example, the user wants to track fixe last trade value for the 
stock RHAT. information stored in ceU E. It is not enough for the user to specify "the 
cell contaming the textLastTrade" because that matches both cells E and G. The usdr 
thus must specify also that the desired cell is contained in a table that also contains 
Uie text "RHAT". This uniquely identifies Cell E. 

A preferred process for identifying a user-selected part of a 
document is Ulustrated in Fig. 20. Steps 2010 - 2080 in Fig. 20 are now described in 
detail. 

Step 2020; Using a pointing device, tiie user clicks or dr^ on a 
rendered version ofthe document to choose the region tiiat is of interest to the user. 
The system typically graphically mdicates the smallest structural element in the 
document that corresponds to die point or region selected by tiie user. The user may 
try clicking or dragging multiple times, until die satisfactory result is achieved. 

26 



wo 01/69448 



PCT/USOl/07567 



Each time, the system typically graphically indicates the element that the user has 
selected In the example, the user selects cell E. 

Step 2030: The user is given the option to enlarge the selected 
element until the user is satisfied that the selected element encompasses the region 
of interest to the user. In the example, the user does not need to enlarge the region. 

Step 2040: The system typically asks the user to identify the 
important property or properties of the selected element that distinguish it from oth^ 
- namely, what it is about the selected element tibat the user is actually int^ested in. 
Examples may include, but are not limited to: The element contains a specific string, 
or a markup tag, or an image of a certain size. The system may also generate mi 
present possibilities to the user on what distinguishes the desired element &om Ihe 
others. In the example, tiie user indicates that the selected cell is special in that it 
contains the text "Last Trade". 

Step 2050: The system then typically detennines the smallest element 
including the selected area whidi matches the criteria from step 2040. The system 
then typically coimts how many levels "up" ("uplevels") are necessary from that 
smallest element to reach the element selected in step 2030. Uplevels are defined 
below in the section (ELA Engine). This does not apply in tiie example, since there 
are zero up-levels. 

Step 2060: The system then typically attempts to determine if the 
criteria assembled so far xmiquely identify the element on the page. This is done by 
finding all elraients on the page that are the same number of uplevels from other 
elements that match the criteria from step 2040. If there are no other matches, the 
criteria are considered sufBcientiy unique for tiie present time and thed:gorithm 
concludes. If there axe other matches, the system indicates them graphically to the 
user. In the example, botii cells E andG match the current description at this staged 
So cell E is the desired region, but cell G is shown as another candidate matdi. 
The user still needs to distinguish between cell E and cell G. 

Step 2070: The syst^ asks the user why the desired region is 
dififerent fiom the other matching regions, using the same kinds of criteria as in step 
2040. At tiiis stage, the user is looking only at element characteristics within the 
desired re^on. The user may dipose to skip to tiie next step, if the user wishes 
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all matches to be selected, or if Ihe distinguishing characteristics are 
outside the selected regions. If this step isn't skipped, go back to step 2060. In 
the example, the distinguishing characteristics are located outside the selected 
region E, so the user skips this step. 

Step 2080: Now, the user can specify distinguishmg characteristics 
located in elements around but not in the desu:ed region. Start graphically indicating 
the region that is "up" one level from the desired element, as well as regions that are 
"up" from the other matching elements. In the example, the user goes one level up 
from die selected cell E, to the containing Table B. However, since cell G is also a 
candidate, the containing Table C is also indicated. 

The system asks the user vAxst inside tiie graphically indicated desired 
region distinguishes it from the other graphically indicated matching regions. 
Step 2080 is repeated for the various desired regions, removing the matching regions 
which are not selected by the new criteria. When complete, the user can go back to 
step 2080 or is done. In die example, the user specifies that the containing 
region (table B) around the selected element (Cell E) is distinguishable m tiiat it 
contains the text "RHAT". This criteria distinguishes Table B from Table C (which 
does not contain the text RHAT), and in turn, distmguishes the contained cell E from 
the contained cell G, and so the user stops at this pomL 

Functions and Analysis 

The system typically provides ustts with the capability to 
perform various types of analysis on the information accessible by tiie system. 
Examples mclude, but are not limited to: determining whether a parficdar stock 
price is over a certam value, determining how jnany new press releases appear in a 
certain list, determinmg whether a stock is rated as "STRONG BUY" or "BUY", 
comparing two prices and returning die hi^CT of the two, etc. 

When configuring the system for analysis, tiie user may specify the 
following parameters: 

1. The information source or sources to be used as inputs in the analysis - 

This may be any information source accessible by the system, including any elements 
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identified by a user, or any documents stored by the system in tlie content database, 

2. The function to be used for the analysis. Functions are described 
below. 

3. The timing • the analysis may be configured to occur once, or any 
number of times, beginning immediately or at a specified time or times, or at regular 
intervals. The user may also indicate when the system should access new copies 
of the contents of information sources. 

4- The output - a function may ou^ut its results to one or more of a 

number of output targets. Hiese include, but are not limited to: output to a file 
system (such as to the contrait database, described below), output to the user 
through one of the system's notification channels (see Notification" section 
above), output to another fimction. 

Functions may be chained - a user may configure the system to first 
analyze information witii one function, and then in turn analyze the resulting 
output with anotiier function. This chaimng preferably may be done indefinitely. 

Functions allow users to perform multiple types of analysis on the 
mfonnation accessible Ccom the system. Using a graphical user interface, a user 
may select bom a set of functions when configuring the system to perform an 
analysis. Examples of the types of functions available include, birt are not limited to: 

1. Mathematical functions (+,V>*> niax, min, etc.) 

2. Textual functions (length, alphabetize, etc.) 

3. Boolean fimctions (AND, OR, NOT, XOR, etc.) 

4. Grouping functions (0, etc.) 

5. Search functions (grep, find, etc.) 

6. Comparison functions (<,>,<>,<,= =<,==>, etc.) 

The system typically comprises an Applications Programmer Interface (API) that 
allows the set of fimctions available to the system to be extended. This way, 
the system may be further customized for users with specialized needs. For 
example, financial users mayareate a function that performs a linear regression on a 
set of values. Scientific users may create a fimction tiiat paforms a statistical 
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analysis on scientific data. 

A prefcared implementation of a system synergistically providing all 
of the above functionalities is now described. Ardhitecturally, the system is typically 
implemented in two main parts, tiie server 2110 and the client 2120 of Fig. 21. 
Most of the functionality is typically implemented in software running on 
commonly available computer hardware — sudi as a computer wifli a Paitium 
m processor, running a Linux operating system - hereafter refiaied to as the server. 
A user typically accesses the server over a digital communications network fix)m 
any commonly available computer tiiat has a connection to tiie Intranet and 
commonly available software known as a standard Web Browser. The clieait 
typically comprises software that is downloaded from tiie server to the user's 
machine and then operates vwthin the user's web browser. The server andthecUent 
then communicate with each other tiirou^out the use of the system. 

Client 2120 of Fig. 21 may, for example, comprise software written 
in tiie Java, JavaScript and HTML languages. The client software is typically 
constructed and operative for communicating witii tiie server and for providing the user 
interface, which involves flaying information to ihs user and getting information 
fiom the user. 

Server 2110 of Fig. 21 typically provides most of the ftmctionality of 
tiie Systran. The server typicdly comprises tiie following interacting fimctional 
blocks, as shown in Fig. 22: Content Service 2210, Portfolio Service 2220. Each of 
ffiie fimctional blocks Vi*ich typically make \ip tiie server is nowdesoribed in d^: 

PortifoKo Service 2220 of Kg. 22 is typicaUy constructed and operative 

jfor interacting wtiithe cUent 2120 (Figi 21) (which in turn interacts wifli tiie 
user). The portfolio sowce transfers information between tiie client and flie 
otiira/ components of tiie system. The portfolio service typically comprises flie 
following interacting subunits, as illustrated in Fig. 23: Portfolio Database 2320, 
PortfoUo API 2310, PortfoUo Interfiices 2330, 2340, 2350, 2360. Each of tiie above 
subunits is' now desoibed in detail. 

Portfolio Database 2320 of Fig. 23 typically stores all ttie information 
about specific users of the system and tiieir portfolios, including tiie 
organized hierarchy of portfolios, folders, and information sources, as well as 
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usemames and passwords, and infonnation about when specific users access 
specific information sources. 

Portfolio API 2310 of Fig. 23 typically accesses the infonnation in 
the portfolio database 2320 and conununicateswidi the content service 2210 (Fig. 21), 
as well as wifli the portfolio interfaces 2330-2360, The portfolio API allows 
adcfitional customized interfaces to &e system to be created 

Portfolio Interfiaces 2330-2360 of Fig. 23 typically interact with the 
portfolio API 2310 and handle communication with the client 2120 (Fig. 21). 
Different portfolio inter&ces interact with different cli^ts. Examples of portfolio 
interfaces include, but are not limited to: the standard, graphical web inter&ce 
2330, a text interfEice 2340, a WAP inter&ce 2350, oAer customized interfaces 2360. 

Content Service 2210 of Fig, 22 typically accesses the information 
soxirces, stores the information, and performs most of &e functionalities of the 
system described above, typically including search, vwrtch, xq)date check, 
information access, picture rendaing, functions and analysis, archiving. The contmt 
service comprises the following functional units, as shown in Fig. 25: Content 
Database 2595, Scheduler 2550, Rules Engine 2530, Content Worker 2520, 
Content Retriever 2510, Content Converter 2590, ELA engine 2580, Content 
Identifier 2570, Picture Renderer 2560, Alerts Notifier 2540. Each component of the 
system is typically implemented using a prioritized queue with multiple workers 
processing requests from the queue. This provides robustness (if a worker dies while 
processing a request, the request will be reassigned to another worker) and 
scdabHily (more workers can be added to handle greater load). 

The internal control format of the system is typically a rule. Rules dkect 
the operation of the various components of the Content Service 2210. Rules are sets 
of instructions tiiat cause the various components of the Content Service 2210 to 
perform certaia actions are specific times. Rules are stored in the Content Database 
2595 and processed by the Rules En^ne2530. 

The internal data format used by the content service typically 
comprises a document A document typically comprises a root file and all the files 
that it contains (such as images and CTibedded documents), as well as all the 
filestfaatthe contained files contain recursively. Adocummt can comefiiom an 
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outside source or be generated internally by the rules engine from zero or more 
other input documents. Each document also typically has a time stamp describing 
when it was retrieved by the context retriever or when it was created by the rules 
engine 2530. The time stamp can be used to chronologically order documents from 
the same source. 

Fig. 24 is a flowchart indicating a typical order of operations of the 
various components of the content service 2210. For example, vdien performing an 
update check, the rules engine 2420 is triggered to begin operation by a pre-scheduled 
event m the scheduler 2410 (i.e. run the rule "update check" on the CNN site every 
two minutes"). The rules engine then dhects tihie contrat worker 2430 to <Urect the 
content retriever 2440 to fetch a specific set of content (the current contents of flie 
CNN site). The content converter 2450 then typically converts the retrieved 
information into the internal format used by the system. The ELA engine 2460 then 
uses any relevant ELA descriptions to identify specific parts of the content. The 
content identifier 2470 removes certain msignificant content, such as advertisements 
and dates. The update check rule may then be run to determine if any new information 
is present The content is then rendered into a picture by the picture renderer 2480. 
The alerts notifier 2490 communicates relevant information to the user through one 
of the notification channels available to the system. 

The various fiinctional units of the content service are now 
described in detail with reference to Figs. 24, 25 and 26: 

Content Database 2595 of Fig. 25 typically stores all documents and 
rules maintained in the system, as well as schedtiling information concerning 
vAien specific rules shoiild be run and how. (For example, the "check if the current 
stock price is bdow 30" rule is scheduled to run every 15 minutes,) This scheduling 
information originates from the user and is stored in the content database 2595 by 
the portfolio service 2220. 

Sdieduler 2550 of Fig. 25 typically reads sdieduling mformation from 
the cozitent database 2595 and invokes rules to be run in a pre-specified &shion at 
pre-specified times, intervals, or concfitions. 

Rules engme 2530 of Fig. 25 ^ically directs the operation of tiiie 
ofiier components within the content service 2210. The op^:ation of th6 system is 
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therefore customizable by inodifying the rules. The rules engine 2530 has a 
scripting language interpreter wifli a set ofbtdlt-in rules, as well as an application 
programmer interface (API) for adding fiirtha: customized rules. There is also a 
mechanism for the rules engine to communicate with tiie other components of the 
system. 

Content Worker 2520 of Fig. 25 is typically constructed and operative 
for drivmg the operation of the 2510 content retriever, which m turn gets all the 
files related to a single document The content woricer 2520 recursively parses 
through a document stored in the content database 2595 to get a list of 
contained files, and directs the content retriever 2510 to get all the files &om the 
appropriate information source. 

Content Retriever 2510 of Fig. 25 typically gets a single file at a time 
from an external source, as directed by the content worker 2520. It implements 
caching to reduce bandwidth consumption. It deals with automatically 
logging in to sites that require a usemame and password. 

Information sources are preferably checked by the content retriever 
2510 if they are included in one or more user portfolios. This is useful in that it 
provides a high level of monitoring service to individual users vdiileat the same 
time optimizing the bandwidth load for the organization as a whole, i.e. Instead of 
rhany users all individually accessing a certain information source, the system 
polls the information source once and notifies each of the users of the relevant 
Moimadon. This can reduce the bandwidth load for the organization as a whole. 

The frequency of cheddng an information source ihay be determined 
according to a number of relevant fectors, including, but not limited to: 

1. User-specified pnorities for monitoring the information source. 

2. PresOTce of the information source in multq>le user portfolios 

3. Information source response tunes 

4. Information source update firequencies 

A particular feature of flie content retriever, according to a preferred 
embodiment of the present invention, is that it optimizes use of bandwdth for 
maintaining relatively up-to-date versions of multiple mformation sources for use 
by multiple users, accoiding to fiie content retriever factors shown and desoibed 
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herein. 

Content Converter 2590 of Fig. 25 typically converts tibe files received 
in various formats into one common internal format (for example, XML), so that the 
other parts of the system may use them. The content converter 2590 has various 
modules for dealing with different file formats. Examples include, but are not 
limited to, MSWORD, PDF, etc. 

Content Identifier 2570 of Fig, 25 typically identifies (and optionally 
removes) specific portions of a document, such as ads and dates, according to 
pre-specified or user-entered identification filths in the system. The content 
identifier may be used to distinguish between significant and non-significant 
changes to content when perfomaing monitoring, as described in the monitoring 
section above. 

Preferred operation of the content identifier is described in Fig. 27 and 
typically comprises the following steps: 

Step 2710: The content identifier reads in a document firom the content 

database. 

Step 2720: The content identifier uses a set of stored "regular 
expressions" (stored in an identifier database) to check for any dates in the document 
and optionally removes the matching text 

Step 2730: The content identifier uses a set of stored URLs 
(stored in an identifier database) to check for any advertisements in the document 
The URLs are those of common commercial adv^tis^ent providers. 

Step 2740: The content identifier removes the structural element 
surrounding thie matched advertisement URL in the document Tliis removes the 
advertisement itself. 

Step 2750: The content identifier ou^uts the filtered document to the 
content database 2595. 

ELA (Element Level Access) Engine 2580 of Fig. 25 is typically, 
constructed and operative for parsing a document received from lan information 
source and extracting the specific portion that a user has described using tiie ELA 
interface described in Fig. 20. The ELA engine 2580 relies on an dement 
description created by the user using the ELA interface (Fig. 20) to extract the 
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appropriate information, which it then puts into a new document. 

An ELA description is a piece of text that describes a specific part of 
an HTML document The goal is to describe as generally as possible the specific part 
(element) of a document, so that that clement can be used for monitoring, 
searching, matching, display, notification, or other purposes within the system. An 
ELA description may include contextual cues that may be used to help furdier 
describe the desired part of the document 

An HTML document can be described as a tree-like structure of 
different elements. HTML elements used by the ELA system include, but are not 
limited to: image, phrase, table, sub-table, line, caption, cell, row, column, item, list, 
paragraph, and frame. The sfructure is mostly a tree. The root elenient is a fiitme, 
and each element may contain one or more other elements of varying types. For 
example, a cell may contain paragraphs, a line may contain phrases and images, and 
a fi-ame may contain paragraphs. It should be noted that the structure is not a 
proper tree because a tablemay be viewed as containing rows, columns, or cells, 
whereas the rows and columns themselves contain the same cells, each of vs^di is 
in both a row, a column, and a table. 

An ELA description is represented in XML and is described by an 
XML Schema. An example of a suitable XML schema is as follows: 

<!- $Id: ela,xsd,v 1.4 2001/02/28 09:16:1 1 marc Exp $ 

<!- defaults: 

minOccurs="l" maxOcGurs="l" 
— > 

<schema xmhis="http://www, w3.org/2000/10/XMLSchema" 

xmhis:xsi="http-y/www.w3.org/2000/10/XMLSchema-instance" 

xmhis:ela-"http://www.broadfufe.com/xmlsch^ .0" 

taigetNamespace="http://www.broadfire,com/xmkchemas/eW^ 
<!- noNameq)aceSchemaLocation===^*XMLSchaiiajcsd" r-> 

<element name="ela" type="ela:elaType" l> 
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<!- this is mostly for testing -> 
<eiement name="elalist"> 
<complexType> 
<sequence> 

<eleinentref="ela:ela" maxOccurs- 'unbounded" l> 
</sequence> 
</complexType> 
</element> 

<fcomplexType name^"elaType"> 
<sequence> 

<element name="match"> 
<complexType> 
<choice> 

<group ref="ela:matchElement" /> 
<group ref="ela:filterElement" > 
</choice> 
</complexType> 
<yelement> 

<element name="upiever* typ6="nonNegativeInteger" 
i)iunOccuis="0" t> 

<elementname="jBlter" minOccur^"0" 
maxOccuis="unbounded"> 
<coniplexType> 
<sequCTice> 

<elem«it nam^"context"> 
<compIexType> 
<groi?) ref="ela:£QterElement" l> 
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</coinplexType> 
</elemeat> 

<element name="choose" niinOccurs=**0"> 
<GomplexType> 
<choice> 

<choice maxOccurs="imboimded"> 

<element naine=''position"> 

<complexType> 
<simpleConteiit> 
<6xtension base="integer"> 

<attribute name^"relop" type="ela.Telop" /> 
<7exteaision> 
</simpleConlent> 
</complexType> 
<yelement> 

<el©Qient iiain^'after"> 
<complexType> 

<choice> 

<group ref="ela:imageMatch" /> 

<group ref="ela:textMatch" /> 
</choice> 

<attribute naine="skip" type^"nonNegativeInteger" f> 
<!- XXX this should be a positivelnteger t 
or "unbounded" -> * 
<attribute naDgLe="couat" typc="strmg" /> 
<attribute naine="range"> 
<simpleType> 
<restriction base===''string''> 
<enumeration value="inclusive*' A> 
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<enumeratioii value^"exclusive" /> 
</restrictioiP> 
</simpleType> 
</attribute> 
</complexType> 
</elemeiit> 

<eleinent name="before"> 
<fcomplexType> 
<choice> 

<group ref="ela:imageMatch" /> 
<group ref="ela:textMatch" l> 
</choice> 

<attribute name="skip" type="nonNegativeInteger" />. 
<!— XXX this should be a positivelnteger 
or "unbounded" — > 

<attribute name="count" type="stiing" l> 
<attribute iiame="xange"> 
<simpleType> 
<restriction base="string'*> 
<enumeration value=*'inclusive" /> 
<enumeration value="exclusive" t> 
<i^iestrictioii> 
</simpleType> 
</attribute> 
</complexType> 

</element> ^ 
</choice> 

<element name="triangulate"> 
<compIexType> 
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<sequence> 
<element name=*'row"> 
<complexTyp^ 
<choice> 

<group ref="ela:imageMatch" t> 
^roup i:ef="cla:textMatch" t> 
<ychoice> 
<ycomplexType> 
<element> 

<element name="coluinn*> 
<complexType> 
<choice> 

<group rejN"ela:imageMatch" t> 
<group ref="ela:textMatdi" t> 
</choice> 
</complexType> 
</eIement> 
</sequence> 
</complexTyp6> 
</elemcnt> 



</choice> 

</complexType> 

<elemmt> 

</sequence> 
<ycompIexType> 
<7elemenP- 



<seqiience> 
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</complexType> 

<group naine="filterElement**> 
<choice> 

<!~ canonical orda: mibin a type is 

<tex£> <image> <select> 

Not all sections will appear within all types, 

<element name="line"> 
<cpniplexType> 
<sequence> 

<group re^="ela,-textMatch" /> 
<group ref=**ela:imageMatch" /> 

</sequence> 
</complexType> 
</element> 

<eleinent name=''caption*'> 

<complexType> 

<sequenpe> 
I 
% 

<gmp te5="eIa:imageMatdi" /> 
</seqifcnce> 
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</complexType> 
<;/element> 

<elemeat name="ceU"> 
<complexType> 
<sequence> 

<groi5) ref='^ela:textMatch'' l> 

<group ref=''ela:iinageMatch'' l> 

</sequence> 
</complexType> 
</element> 

<elemeat name="row"> 
<complexType> 
<sequence> 

<group ref="ela:textMatch" /> 

<groi5) reiN"eIa:imageMatch" t> 

</sequence> 
<ycomplexType> 
</element> 

<element nameF=**column"> 
<complexTyp^ 
<sequence> 

<^oup re^"elanextMatch" /> 
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<group ref="ela:imageMatch" t> 

</sequence> 
</complexType> 
<yelement> 

<element name="table"> 
<compIexType> 
<sequence> 

<group re^"ela:textMatch" t> 

<feioup ref="ela:imageMatdi" t> 

<choiceininOcciirs="0'' maxOccurs="imbounded"> 

<element name="iows"> 
<complexType> 

<SiinpleConteiit> 

<extension base="positiveInteger"> 
<attribute name="relop" typeF="ela:relop" t> 

</extension> 
</simpleCoiiteQt> 
<ycoinplexType> 

</element> 

<element nameF="columns"> 
<coinplexType> 
<siinpleContent> 

<fextension base="positiveIiiteger"> 
<aftribute name="relop" type="ela:ielop" t> 
</extension> 
</simpleContent> 
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</compIexType> 
</element> 
</choice> 

<element name="select*' minOcciirs="0"> 

<complexType> 
<attribute nameF="type"> 
<siinpleType> 
<restriction base="string"> 
<enumeration value^"first" l> 
<enumeration value^"last" hp- 
<enumeratiGn value="widest" l> 
<eniimeration valne="tallest" /> 
<enumeration value="largest" l> 
</restriction> 
</simpleType> 
</attribute> 
</complexType> 
<yelement> 

<ysequenGe> 
</complexType> 
</elCTient> 

<felement iiame="item"> 
<complexType> 
<^uence> 

<groirp ref="ela:textMatch" /> 

<groiq) ref="ela:iniageMatdi'' t> 
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</complexType> 
</element> 

<element name="list"> 
<complexType> 
<sequence> 

<grQup ref="ela:textMatch" t> 

<group ref="ela:imageMatch" l> 

<eleinent name="items" ininOcciirs="0" 
maxOccurs="unbounded"> 
<complexType> 
<CsimpleContent> 

<exteasion base="positiveInteger"> 
Ottribute name="relop" typeF="ela:relop" l> 
</extensioit> 
</simpleConten1> 
<;/complexType> 
</eiemen^ 

<element name="select" minOccursF*'0"> 
<complexType> 
<attribute name="type"> 
<SimpleType> 
<restriction base="string*'> 
<enumeration value="longest" l> 
<yrestriction> 
</siinpleType> 
</attribute> 
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</complexType> 
</element> 

</sequence> 
</complexType> 
</element> 

<element name="paragraph"> 
<complextype> 
<sequeiice> 

<group ref="ela:textMatch" l> 

<groq) ref="ela:imageMatch" t> 

</seqifcnce> 
</c6mpIexType> 
</elenient> 

<element nam^"frame"> 
<complexType> 
<Sequence> 

<group ref="ela:textMatch" l> 

<group ref="ela:imageMatch" /> 

</sequence> 
<ycompIexType> 
<yelement> 

</choice> 
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</group> 

<group name="niatchElement'*> 
<choice> 

<element name="image"> 
<complqcType> 
<sequence> 

<group rejN"ela:imageMatch" t> 

<eleinent name="select** minOccurs="0*^ 
<complexType> 
<attribute name="type"> 
<simpleType> 
<restriction base="string"> 
<enumeration value="first" l> 
<enumeration value="last" l> 
<enumeration value="widest" t> 
<eniimeration value="tallest" l> 
<enumeration value="largest" l> 
<yrestriction> 
</simpleType> 
<yattribute> 

<ycomplexType> 
•</element> 

</sequ6nce> 
</complexType> 
</element> 

<elemeiit naine="phrase''> 
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<compIexType> 
<sequence> 

<group ref="eIa:textMatch" l> 

<elemeat name="urr' type="string" 
mmOccurs='*0'' maxOcciirs='*unbouiidcd" t> 

</sequeiice> 
</complexType> 
</element> 

<element name="subtable"> 
<complexType> 
<sequeiice> 

<group ref="ela:textMatch*' /> 
<group ref="ela:imageMatch" t> 
</sequence> 

<attribute name="left" type="integCT" t> 
<attribiite name="right" type="integer" l> 
<attribute name="top" type="integer" t> 
<attribute name="bottom" type="integer" l> 
</complexType> 
<yeleinent> 

<choice> 
<ygroup> 

<group iiame?="imageMiatch"> 
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<sequeiice> 
<element name="image" minOceurs="0" 
maxOccurs="unboimded"> 
<complexType> 
<choice inaxOccurs=="mbounded*'> 
<element name="width"> 
<complexType> 
<simpleContent> 
<extension base="nonNegativeInteger*'> 
<attribute name=="relop" type="ela:relGp" l> 
</extension> 
</simpleContent> 
</complexType> 
</element> 

<element name="height*'> 

<compIexType> 
<simpleContent> 

<exteiisioii base="nonNegativeInteger"> 

<attiibute name="relop" type="ela:relop" > 

</extensioii> 
<ysimpleContent> 

<ycomplexTyp^ 
</elenimt> 

<element name='*src" type="striiig" t> 
<element name="alt" ^rpe="string" /> 

</choice> 

</complexType> 
<yelement?> 

</sequCTice> 
<group> 

<^roiip iiame="tex(Matdi*'> 
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<sequence> 
<elemeiit name="text" mmOccurs="0" 
maxOccurs="unbounded"> 
<complexType> 
<choice maxOccurs="unboiinded"> 
<element jiame="contains" type="string" l> 
<element name="fece" type=**string" t> 
<element name="color" type="stiing" /i> 
<element nameF="foiit-famfly" type="string" l> 
<element name="size" type="positiveInteger" t> 
</choice> 
</complexType> 
<elemmt> 
</seqiience> 

</group> , 

<^pleType name="i:cIop*'> 
<restriction base="string"> 
<enumeration value="eq" t> 
<enumeration value="lt" /> 
<eniimeration value="gt" l> 
<entimeration value="le" l> 
<enumeration value="ge" /> 
<eniimeration value="ne" A> 
</restriction> 

<ysimpleType> ^ 
<7scheind> 

Each £LA descnption typically comprises one, some, or all of ^e 
foUowii^ fliree parts: 
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1. The first, main part is a <match> tag that describes the desired 

element. This tag typically describes the element to match as precisely as possible 
without taldng into account &e context around the element, but focusing instead on 
the contents of the element itself. An element may be described by the type of the 
element and by a combination of text contained m the element, images contained 
in the element, characteristics of the element itself (for example, for an image, the 
source URL of theimage)^ 

2- The second partis an <uplevel> tag stating the number ofuplevels 

to use when matching. An uplevel typically describes a situation where an element 
is contained within another element of a similar type. For example, witii an uplevel of 
0, a description could describe "the cell containing the words 'Last Trade"'. With an 
uplevel of 1, a description could describe "the cell containing the cell containing the 
words 'Last Trade'", etc. TTie default uplevel is 0. The semantics of this are 
described in the algorithm below. 

3, The third part is a list of <filter> tags. Each filter typically 

describes a property of the element or of its surroundings. Filters may be used in 
series to filter out multiple potential matches in order to ultimately identify the 
single desired element Filters may be based on descriptions of the element's 
context, comparisons between multiple matching candidate elements, as well as 
the location of the element relative to other elements in the document Three 
types of filters are now described: cont^ filta:s, comparison filters, and location 
filters. 

A. Context Filter - A context filter desc^bes the desired element 
according to the properties of an element that contains it For example, a matdi tag for 
"a cell that; contdns the text "Last Trade' may be used in conjunction with the 
filter "contained in a table that has the text "RHAP". (see example below) 

B. Comparison filter - A comparison filter is based on a comparison 
between multiple matching candidates. For example, a match tag for "any image" 
may be used in conjunction with the filter **tfae largest of all the images'*. 
Comparison fih^ include, but are not limited to: largest, smallest, tallest, widest, 
first, last 
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C Location filters - Location filters may be used to identify a desired 

element or group of elements ("the desired element") fix)m within a set of elements 
that are contained in a larger element ("the context") Location filters include, but are 
not limited to position location filters and before-and-after-location filters, each of 
A^ch is described below. 

Position Location Filters: The position filter may be used to identify a 
desired element within a context, according to the position of the desired element 
within the set of elements that are contained in the context Examples include, but 
are not limited to: In a context containing ten cells, "^e 3rd cell", "the first two 
cells", "the thkd through fiflh cells" "tiie second tiirough tiuxd-frdm-last cells", "the 
last four cells", etc. 

Before and After Location Filters: The desired element is identified 
by its position relative to another, more, easy-to-identify element Q*ib& anchor") also 
located in the context. EXAMPLE: the context is a column of cells. The desired 
element is a particular cell within the context that contains constantiy changing text 
(e.g.. breaking news stories) and is therefore difficult to describe according to the 
text that it contains. The anchor is a cell immediately preceding the desired element 
fliat always contains the text "Today's Breaking News". An "after" filter maybe 
used to create the description "the cell that is one element after the cell that contains 
the text Today's Breaking News'". Before and After filters may specify an anchor 
description, a skip distance (e.g. "beginning one after tiie anchor, two after, 
etc."), and a spanning length (how many elements to include, e.g "select the three 
cells that begin one after the cell containing the text "Today's Breaking News"). 
Before and After filters may be used in conjunction with one another to describe a 
specific range of cells. 

An example of an ELA description is found below. The example 
describes tfie desired cell pictured in Fig. 19. The HTML document includes a 
set of tables containing various stock quotes. The user is interested in the "Last 
Trade" price of flie stock "RHAT". The user flius indicates that the desired element 
is "tiie cell contammg the text 'Last Trade'". However, since flieie are multiple 
stocks rq)orted in this document, a context filter uses the context of the containing 
table to desCTibe the desired element The fiill description thus reads: "the cell 
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containing the text 'Last Trade* in the table that contains the text 'RHAP" 

Here is an example of an EL A description: 
<ela:6la> 
<match> 
<cell> 

<textXcontains>Last Trade</contdnsPX/text> 
<yceU> 
</match> 

<uplevel>0</uplevel><!— default, may be omitted -> 
<filter> 
<context> 
<table> 

<text><containsi>RHAT</contaiDS></text> 
</tabIe> 
</context> 
<yfilter> 
</el2ueIjP^ 

The first part is a <match> tag that describes a cell. The cell described 
is any which contains tiie text ''Last Trade". The next part is the i5)level, which is 0. 
The thiid part is a <filter> tag that describes a single containing element ITie 
containing dement is a table, v*5di contains the tsxt "RHAP'. Given an HTML 
document and an ELA desdiption, a process by which Ac system may identify the 
desired element is now described. Definitions and variables pertaining to a preferred 
process are first described, followed by a description of flie steps a - e vMch the 
process preferably comprises. 

Definitions: 

A "minimal set" of matdies is one in whidi no elemait contains 
another element in the non-minimal set this avoids ambiguities in certain cases. 
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Picture Renderer 2560 of Fig. 25 creates a graphical image from a 
document, which may be used in the folder view part of the user interface (Fig. 2). 
A preferred method of operation for the Picture Renderer 2560 is described in Fig. 
28 and preferably includes the folloAving steps: 

Step 2810: The picture renderer 2560 reads in a document from the 
content database 2595, 

Step 2820: The picture renderer 2560 identifies the document 

structure. 

Step 2830: The pictute renderer 2560 creates a geometric description of 
a document based on the structure. 

Step 2840: The picture renderer 2560 creates a picture based on 
the geometric description. 

Alerts Notifier 2540 of Fig. 25 typically sends a document to the 
user, via any of a ntimber of services. Examples include, but are not limited to email, 
sms,frix, and Instant Messenger. 

The internal representation of an ELA description shown and 
described herein allows the system of the present invention to handle a high level of 
resolution, including cells and rows, grouping of contiguous/non-contiguous elements, 
flexible descriptions of elements based on a combination of multiple internal 
properties, and multiple relationships to other elements. A particular advantage of the 
preferred internal representation shown and described herein is that it allows the system 
to identify the desired elements consistently within a changing document, eVen in 
the face of other elements in the document that contain many similarities and/or certain 
modifications to the stmcture and content of the document 

The following example work-sessions describe how an end-user may 
use the sy stem of the present invention to benefit from some of its fimctionalities. The 
user in the example is an employee at a financial services organization. The 
following example work-sessions are described: Portfolio creation. Accessing 
information, Searching and watching. Archiving, Groups and sharing, Functions and 
analyses. 
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Example I: Portfolio Creation Worksession 

Using a graphical user interface, a user, John Doe, creates a portfolio 
when using the system for the first time. This involves entering the user name and 
password that will be required for the user to access his portfolio. The user also 
enters information that the system may use to communicate with the user over certain 
notification channels (like email, pager, fax, etc.). 

The user is assigned a new, empty portfolio — one that contains no 
folders and no information sources. Using a grz^hical user interface, the user adds 
new folders to his portfolio. For example, the user creates a folder named 
"Releases", which he intends to populate with information sources, such as 
\vebsites titot contain press releases of companies in which he is interested. 
The usor also creates a Folder named "Stocks", vsdiich he intends to populate with 
information sources related to the stocks in which he is interested 

Using a graphical user interface, the user then adds information 
sources to the folders that he has created. Forexample^ ttie user adds the web sites 
listing the up-to-date press releases of certain corporations to the "Releases" folder. 
Either these sites contain solely press releases, or the user may use Elaement Level 
Access to specify flie specific parts of the web pages fliat contain the press releases. 
The user also wishes to select a stock price fix)madocumwt that contains a 
list of stock prices. Using the gnq)hical user interfece described above in the 
section "ideritifyiEB Information Sources Witiiin Documents", the user selects 
the q>ecific stock price he isinterested in fi?om thedocument 

Example H: Accessing Information Worksesadn 

After creating the portfolio and populating it with the information 
sources of interest, the user may use the system to speed his access to the information. 
If the user did not have the system available, tiie nser would need to begui each 
work-day by using a web browser to visit each press release site individually to check 
for new press releases. Now, wth the system, the user can simply open up flie 
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"Releases'" folder that he has defined wlhin his Folder View, and instantly 
view all of the information sources miniaturized, tiled ac^ss the browser 
window. Any information sources that have changed since the last time flie user 
had checked them are indicated by a colored border. The usor migjit mstantiy see that 
only three out of nine information sources have changed. This means that the user 
does not have to check the other six that have not changed, saving the user 
significant amounts of time. 

To preview an information source, the user may invoke an 
information source preview by moving the pointing device so that the cursor is 
positioned over the name of the information source. The preview allows the user to 
seethe contents of an information source (by looking at the rendered picture of a 
version of the content that is pre^^ached on the server) witiiout having to wait to 
retrieve the information source dnectiy bom its source, saving additional time. 
The user may access an information source directiy by clicking on the pictoral 
representation of the information source in the topic window. 

Example HI: Searching and Watching Worksession 

The user now wants to know if any of the companies in the 
"Releases" foldei: have issued a press release about Aeir earning recenfly. Using a 
graphical user int^ace, auser sets rsp a search for tiie search term "Earnings" 
With the search domain being the "Releases" folder in his portfolio. The system 
perfornis the search and returnis a list of results, listing any m a tch i n g press releases. 
Using a standard search ^igihe, the usct would have had to indicate the various 
companies that the us^ is interested in searching. Usmg the present systeni, however, 
the list of companies tiiat interest the user are ahr^y in tfaesyst^ in the form of 
flie us^s portfolio. After having set iq> the portfolio just once, all the user needs to do 
is specify the appropriate folder to isearch each time a search is to bep^ormed. 
In this way, the combination offlie search feature with flhte ability of the user to 
store an organized collection of information sources on tiie system results in 
added convenience for the user. 

The user may then want to be notified at any time during the 
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following week if any of the press releases appearing over that period relate to 
corporate earnings. The user therefore sets up a watch, similar to the previous search, 
with the duration set to one week. In this example, the user specifies fax 
notification. Sometime later that week, a new press release relating to earnings 
appears on one of the information sources included in the "Releases" folder. Soon 
thereafter, tlie system notices the matching press release, and communicates the 
results to the user on the user's fax machine. 

Example IV: Archiving Worksession 

The user wants to store the content of an information source for later 
reference, for example one of the press releases spearing in an information source 
in the "Releases" folder. Usuig £t graphical user interfece, ' the user archives Ihe 
content of interest. At a later time, flie user may access the archive througji 
mode 4 of the topic window representing ttie information source. This information 
will th«i be available to the user even if it is no longer stored on the original 
information source. 

Example V: Groups and sharing Worksession 

The user wants to share his information with a number of 
colleagues. Using a graphical user interface, the user sets up a group named 
"colleagues" that includes tiie login names of the various colleagues. The user may 
then share various parts of his portfolio with the "colleagues" group. 

For example, the user may make his "Releases" folder available to the 
group. The various users in tiie group may then import the "Releases" folder into 
their own portfolios. One user m the group can then create an archive for die I)enefit 
of another - for example when another user is absent during the period of timettiat 
a specific piece of content is available on an information source. Users can also discuss 
developments in the press releases usmg notes. When a new notes is created by 
another userm the group, a graphical indication appears on the notes on a user's 
red The notes are accessible tiffoughmode 2 of tiie topic window represmting 
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the information source. One user can set a watch in which other users m a group 
will be notified when a result matches. 

Example VI: Functions and Analyses Worksession 

The user may configure the system to p^form certain analyses on the 
information contmned in the portfoUo. For example, the user nmy direct the system 
to notify him eveiy time a stock price goes above a certain value. 
Alternatively, the user may direct the system to automatically archive the 
contents of an infonnation source every time a press release with the words 
Earnings appears. 

It is ^predated that the software components of the present invention 
may, if desired, be implemented in ROM (read-only memory) form. The software 
components may, generally, be implemented in hardware, if desired, using 
conventional techniques. 

It is appreciated that various features of the invention vMch are, for 
clarity, described in the contexts of separate embodiments may also be provided in 
combination in a single embodiment Conversely, various features of the invention 
which are, for brevity, described in the context of a single enabodiment may also 
be provided separately or in any suitable subcombination. 

It will be appreciated by persons skilled in the art that the present 
invention is not limited to what has been particularly shown and described 
hereinabove. Rather, the scope of the present invention is defined only by the 
claims that follow: 

} 
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CLAIMS 

1 . An information management system comprising: 
a plurality of information sources; and 

an information source previewer operative to provide a preview of the 
information sources comprising a less than complete view of at least some of the 
information sources. 

2. An infonnation management system comprising: 
at least one representations of infonnation sources; 

a graphical user interface integrated with at least one of the representations of the 
information sources; and 

an archiving system operative to allow users to time-stamp and archive at 
least one representations of infonnation sources. 

3. A system according to clahn 2 i^erein said archiving system is operative to 
allow remote archiving. 

4. A system according to claim 2 whrein said archiving system comprises an 
annotator. 

5. A system according to claim 2 wherein said gr£q)hical user int^ace allows a user 
to specify which of a pitbratity of other users can access the content .and how long 
content is to be stored. 

6. An iafoimation m^magement ^stem comprising: 

an archiving system operative to allow users to time-stan^) and archive content; 

and 

a scheduling system allowing the archiving system to operate automatically in 
accordance with a predetemdned schedule. 



7. 



A system according to claim 6 wherein the scheduling system operates the 
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arcMving system in accordance with at least one triggering nile. 

8. A system according to claim 6 wherein the scheduling system is operative to 
perform a watch function in which predefined content is watched for. 

9. An informadon management system comprising: 
a content searcher; 

a search-defining GUI allowing a user to define a search; and 
a watch-defining GUI allowing a user to define a watch at least hy automatically 
converting a previously defined search into a watch. 

10. An information management systan comprising: 
a content searcher; and 

a search-defining GUI allowing a user to define at least fireshness of search. 

11. An information management system comprising: 
a content searcher; and 

a search-defining GUI allowing a user to define at least depth of search. 

12. An information management system comprising: 
a content searcher; and 

a search-defining GUIallowmgausar to define at least duration of search. 

13. An information management system comprising: 

a(i information source manager iucluding a set of user-defined information 
sources; 

a content searcher, and 

a search-defining GUI allowing a user to define a subset of the user-defined 
information sources to be seardied. 



14. An information management system comprising: 
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a server storing user-defined foldeis, and 

a client via which a tiscr can view at least some of the user-defined folders. 

15. An information management system comprising: 

at least one representations of information sources including graphic 
representation of check-update status; and 

a check-update status maintainer operative to monitor the check-update status of 
each information source and to maintain the graphic representation of the 
check-update status accordingly. 

1 6. An mformation management system comprising: 

a search results GUI including a plurality of separate result windows for separate 
search results. 

17. An infomiation management system comprising: 

a document portion identification GUI operative to allow a user to 
graphically identify a portion of a document using a targeted set of questions; and 

a document portion processing unit operative to perform at least one process 
on a document portion defined by a user via the document portion identification GUI. 

18: A ^stem according to claim 12 wfaidi is operative to perform a search over a 
specific part of an information source. 

19. An infoimation manag^ent system comprising: 
a plurality of information management tools; 
an information source; and 

a GUI (graphic user interface) integrating the plurality of information 
management tools around the information source using a graphical representation. 
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20. A system according to claim 1 n^iierein at least one of Ae information 
sources is selectably accessed via a locally stored copy thereof rather than directly. 

21. A system according to claim 8 wherein flie scheduUng system 
performs the watch function over a user-defined set of information sources and over a 
user-defined time period. 

22. A system accordii^ to claim 8 wherein the scheduling system 
comprises a notifier operative to notify a user of •Tuts", the notifier employing any of a 
plurality of user-selectable notification modes. 

23. An information managenient system comprising: 

a watch unit operative to watch for a defined unit of information in a flow of 
infomiafion; and 
anELAunit 

24. A system according to claim 23 which is operative to perform an ongoing 
search over a specific part of an infonnation source. 

25. An information management system comprising: 
an update checking \mit;and 

anELAunit 

26. A system according to claun 25 which is operative to perform an ongoing 
update-check over a specific part of an information source. 

27. A system accor£ng to claim 17 vdierdn the document portion 
proceissmg unitisprogramniable to perform custbniized fimcMons; thereby 

user to perform custonoized processes on specific document portions. 

28. A system according to claim 14 wherein the client displays multiple 
sources simultaneously. 
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29. A system according to claim 14 v^^ein the client operates within a 
standard web browser without downloading and installmg specialized software. 

30. A system according to claim 16 wherein the search results GUI 
displays a list of results and, simultaneously, the results themselves in separate 
windows. 

31. An information management system comprismg: 

a functional unit operative to perform a plurality of selectable functions 

on information; and 

an automatic information retriever operative to automatically 
retrieve information from a plurality of information sources. 

32. A system according to claim 31 whei»in the automatic information 
retriever is selectably optative to automatically retrieve information on a 
condition-triggered basis. 

33. A system according to claim 31 vdierein multiple user-selectable 
notification methods are employed to bring system woric products to a user's attention. 

34. A system according to claim 31 and also compriang an interface 
allowing mobile access to and control of the sys^oL 

35. An information management system comprising:. 

an information source processor operative for performing user-selectable 
information management processes on any user-selectable infermation source from 
among a. plurality of information sources; and 

an ELA interface constructed and operative to allow a user to identify 
specific elements of documents as information sources. 
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36. A system according to claim 35 wherein the specific elements which sl 
user is allowed to identify include at least one of the following group: image, phrase, 
table, sub-table, line, caption, ceU, row, colunm, itein, list, paragraph, fiame. 

37. A system accor(Mng to claim 35 wherein the ELA intocfacc is operative 
to group several elements in a document. 

38. A system according to claim 37 wherein the ELA inter&ce is 
operative to contiguously group several elements in a document 

39. A system according to claim 37 wherein the ELA interface is operative, 
to non-contiguously group several elements in a document 

40. A system according to claim 35 wherem a groiq) of at least one 
elements may be identified by means of a combination of at least one internal 
properties, 

41. A system according to claim 35 \^*ierein a group of at least one 
elements may be identified by means of their relationdiips to other elements 
ha\dng a specified combination of at least one internal properties. 

42. A system according to claim 40 wherein the internal properties include 
at leaist one of the following group: contains a specified text, possesses at least one 
descriptive foimatdi^ property, contains specified markup-tag information. 

43. A system according to claim 42 wherein the at least one descriptive 
formiatting property comprises at least one of the foUovnng group of property types: 
a color property, a size property, and a style property. 

44. A system accordmg to claim 41 i^ierein sdd relationships 
comprise at least one of the following type of relationships: after, before, between, 
contained in, location in group, bigg^, biggest in groiq), first, smallest, largest 
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45. A system accoiding to claim 31 and also comprising an ELA unit 
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